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ABSTRACT 

The statistical tools needed to obtain a mass function from realistic collapse time es- 
timates are presented. Collapse dynamics has been dealt with in paper I of this series 
by means of the powerful Lagrangian perturbation theory and the simple ellipsoidal 
collapse model. The basic quantity considered here is the inverse collapse time F; it 
is a non-linear functional of the initial potential, with a non-Gaussian distribution. In 
the case of sharp fc-space smoothing, it is demonstrated that the fraction of collapsed 
mass can be determined by extending to the F process the diffusion formalism intro- 
duced by Bond et al. (1991). The problem is then reduced to a random walk with 
a moving absorbing barrier, and numerically solved; an accurate analytical fit, valid 
for small and moderate resolutions, is found. For Gaussian smoothing, the F trajec- 
tories are strongly correlated in resolution. In this case, an approximation proposed 
by Peacock & Heavens (1990) can be used to determine the mass functions. Gaussian 
smoothing is preferred, as it optimizes the performances of dynamical predictions and 
stabilizes the F trajectories. The relation between resolution and mass is treated at 
a heuristic level, and the consequences of this approximation are discussed. The re- 
sulting mass functions, compared to the classical Press & Schechter (1974) one, are 
shifted toward large masses (confirming the findings of Monaco 1995), and tend to 
give more intermediate-mass objects at the expense of small-mass objects. However, 
the small-mass part of the mass function, which depends on uncertain dynamics and is 
likely to be affected by uncertainties in the resolution-mass relation, is not considered 
a robust prediction of this theory. 
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1 INTRODUCTION 

An important outcome of any cosmological model is the 
mass distribution of those cosmic collapsed structures which 
are predicted to form; this quantity is usually called mass 
function (hereafter MF) or multiplicity function. The theo- 
retical determination of this quantity is difficult, as cosmo- 
logical collapsed structures are the sites of non-linear gravi- 
tational dynamics. No exact analytical solution of the non- 
linear collapse of a general self-gravitating system is known. 
The first attempt to determine the number of collapsed ob- 
jects was made by Press & Schechter (1974; hereafter PS); 
to predict the collapse of a mass clump, they used a heuris- 
tic argument based on the extrapolation of linear theory to 
the highly non-linear regime, and on the spherical collapse 
model, whose solutions are analytically known. Since PS, 
most works on the MF have been based on similar dynami- 
cal arguments. 



However, a number of dynamical approximations have 
recently been developed. These approximations provide a 
reasonable description of the collapse of a self-gravitating, 
pressure-less fluid up to caustic formation, when the or- 
bits of different mass elements cross each other (orbit cross- 
ing, hereafter OC, or shell crossing). In a previous paper 
(Monaco 1995, hereafter M95), the effects of non-spherical 
collapse were analyzed in a PS-like approach; in that case, 
the fraction of collapsed mass was identified with the prob- 
ability of having suitable initial conditions such as to make 
a mass element collapse. The dynamical tools used in that 
case were the Zel'dovich approximation (Zel'dovich 1970) 
and the homogeneous ellipsoid collapse model. 

This paper is the second in a series in which a new 
theory for the MF of cosmic structures is constructed. The 
idea, contained in M95, of constructing an MF based on re- 
alistic collapse dynamics, is the basis of the whole theory. 
The first paper of this series (Monaco 1996, hereafter paper 
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I) develops the dynamical tools needed to obtain an MF. 
As already noted by M95, the MF dynamical problem (in 
its fluid limit) is intrinsically Lagrangian, in the sense that 
it is best faced within a Lagrangian fluidodynamical frame- 
work. Thus, all the tools of the Lagrangian formulation of 
gravitational dynamics can be used: the Zel'dovich approx- 
imation, Lagrangian perturbation theory (see, e.g, Bouchet 
et al. 1995; Buchert 1994; Catelan 1995; complete references 
are given in paper I), and the ellipsoidal collapse model. In 
paper I, all these dynamical approximations are analyzed, 
with the following results: 

(i) As in M95, the collapse of a mass element is iden- 
tified with the OC instant; this definition has been amply 
discussed. 

(ii) Lagrangian perturbations are applied to smoothed 
versions of the initial potential; it is assumed that smaller 
scales do not influence the collapse significantly. 

(iii) The Lagrangian series, up to third order, converges 
in predicting the collapse of a homogeneous ellipsoid. 

(iv) As a consequence, the collapse time of a homogeneous 
ellipsoid can be estimated in an easy and fast way by means 
of the third-order Lagrangian series, with a correction for 
quasi-spherical collapses. 

(v) Ellipsoidal collapse can be seen as a particular trunca- 
tion of the Lagrangian series, when all the more than second 
derivatives of the initial peculiar gravitational potential are 
neglected. 

(vi) In the general case of a Gaussian field with scale- 
free power spectrum, the Lagrangian series is shown to con- 
verge in predicting the collapse of a mass element, when 
fast-collapsing mass elements, representing at least 10 per 
cent of the mass, are taken into account. Convergence is 
valid at a qualitative level for about 50 per cent of the mass. 

(vii) The homogeneous ellipsoidal collapse correctly re- 
produces the collapse time prediction of the Lagrangian se- 
ries in the same convergence range. 

The main quantitative outcome of paper I is the prob- 
ability distribution function (hereafter PDF) of the inverse 
collapse times F (in the spherical collapse case F is just 
proportional to the density contrast). These are preferred 
to the collapse times as their distribution is better behaved: 
the inverse collapse times are large, but of order one, for 
fast collapsing points, and become smaller and smaller for 
slowly collapsing ones; non-collapsing points have vanishing 
or negative F values. 

This paper, paper II, faces the problem of finding an MF 
from the PDF of the inverse collapse times. The statistics 
needed to 'count' the objects, which form according to the 
dynamical predictions used in paper I, is developed, and the 
resulting MFs are analyzed and commented. A comparison 
of the whole theory to N-body simulations is underway (pa- 
per III). This paper is organized as follows: Section 2 con- 
tains an overview of the statistical procedures which have 
been developed by previous authors to obtain an MF; this 
is useful so as better to define the strategy for the statistical 
procedure to develop. In Section 3 the fraction of collapsed 
mass, as a function of the resolution, is determined for sharp 
fc-space filtering. This quantity can be determined by solv- 
ing a multi- (infinite-) dimensional diffusion process with an 
absorbing barrier; however, it is numerically shown that the 
same solution can be found by finding the first upcrossing 



rate of the F process alone, as if F were a Markov process; 
the precise behaviour of the F process as a function of res- 
olution is left as an open problem. Then, the determination 
of the MF can be reduced to a diffusion problem with a 
moving absorbing barrier. Numerical solutions and accurate 
analytical approximations for the fraction of collapsed mass 
are presented. In Section 4 the Gaussian smoothing case is 
analyzed. In this case the collapse time trajectories present 
strong correlations in resolution, which considerably com- 
plicate the problem. The results obtained from numerical 
simulations of individual trajectories are successfully com- 
pared to a reasonable approximation proposed by Peacock & 
Heavens (1990). The passage from the resolution variable to 
the mass variable is discussed in Section 5: a 'missing piece' 
is identified, related to the mass distribution of the extended 
collapsed structures. Section 6 contains a summary of the 
main results of the paper and some final comments. 



2 STATISTICS AND THE MF: AN OVERVIEW 

Once a collapse condition is given for every point, and its sta- 
tistical properties, namely its PDF, are known, a statistical 
framework has to be developed, to 'count' the number of col- 
lapsed structures. The main statistical procedures proposed 
in the literature are reviewed in the following. Throughout 
the paper the following notation will be used: the variable 
A = <xf, equal to the mass variance, will be used as the 
resolution variable, F will denote the inverse collapse time, 
and its PDF will be denoted as Pf(F,A); in this Section, F 
is generally equal to <5;/<5 c , where Si is the linearly extrap- 
olated density contrast and S c is a threshold. The growing 
mode b{i) will be used as the physical time variable; it is 
demonstrated in paper I that this choice makes it possi- 
ble to parameterize out the dependence of dynamics on the 
background cosmology with a great accuracy. The integral 
MF, i.e. the fraction of collapsed mass, will be denoted by 
Q(< A) or Q(> M), whether it is considered as a function of 
the resolution A or of the mass M. Finally, the differential 
MF, the number of objects per unit volume and unit mass 
interval, will be denoted by n(M), while n(A) will denote 
the A derivative of S7(< A) (see Section 5 for the relation 
between the two n functions). 

In the seminal PS paper, a point was supposed to col- 
lapse whenever its linearly extrapolated density contrast, 
smoothed at a scale -R(A) with some filter, exceeded a given 
threshold 8 C (equal to 1.69 if the spherical collapse model 
was invoked). Thus, the fraction of collapsed mass at a given 
resolution was: 

P oo 

0(<A)= / Pf(F, A)dF ; (1) 
Jf c 

(F c =l if F = Si/S c ). To get the MF as a function of the mass, 
M was set equal to the mass contained by the smoothing 
filter; in the top-hat filter case, M = 47rpi?(A) 3 /3. When the 
filter is not a top-hat in real space, one can again obtain a 
relation between A and M, even though its physical meaning 
is less clear (see, e.g., Bond et al. 1991). 

One of the weaknesses of this statistical approach was 
immediately clear: for any threshold S c of order one, the in- 
tegral in equation (Q) tends to 1/2 as A becomes very large, 
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so that just one half of all the mass in the Universe is pre- 
dicted to collapse; PS overcame this normalization problem 
by multiplying their MF by a 'fudge' factor of 2. This 1/2 
factor is a signature of linear theory, which predicts that 
only initially overdense regions (one half of the total mass) 
are able to collapse. On the other hand, the lack of nor- 
malization of the PS MF is caused by the incorrect way of 
counting collapsed points. As a matter of fact, a collapse pre- 
diction is given to each point for any resolution A; in other 
words, a whole trajectory in the F—A plane is assigned to 
each point. At small A values (large R), all trajectories lie 
below the threshold, but when A grows the trajectories can 
upcross or downcross the threshold. The first upcrossing is 
related to the collapse of the point at that scale; any further 
downcrossing is meaningless: a point which is collapsed at a 
scale R (and is included in a region of size > R) cannot be 
considered as not collapsed at a smaller scale R' < R (and 
not included in any collapsed region of size >R'). This fact 
was first recognized and solved by Epstein (1983), then by 
Peacock & Heavens (1990; hereafter PH) and by Bond et al. 
(1991; hereafter BCEK). To correct for this cloud-in- cloud 
problem, the integral MF has to be related to the proba- 
bility Pp° up (F, A) that a trajectory has never upcrossed the 
threshold: 



0(< A) 



pnoup 
F 



(F, A)dF . 



(2) 



The determination of Pp° up (F, A) is generally difficult: 
Pp° up (F, A) is the conditional probability of F being > F c 
at A, given F < F c at all smaller A values, so that all the 
N-point correlations of F at different A have to be consid- 
ered. The problem is greatly simplified if the filter function 
is a top-hat in the Fourier space (sharp fc-space filtering, 
hereafter SKS; see BCEK). In this case, provided the initial 
density field is a Gaussian process, independent modes are 
added as the resolution changes, and the trajectories in the 
F—A plane are random walks (F is a Markov process, in par- 
ticular a Wiener process; see Section 3 for further details). 
Thus, the first upcrossing problem is equivalent to a ran- 
dom walk with a (fixed) absorbing barrier. The PDF of the 
trajectories obeys a Fokker-Planck equation (see equation 



0) 



^pr p (F,A), 



(3) 



with the boundary condition P 
is: 



P F oup (F,A)dF - 



noup 

F 



(F C ,A) = 0. The solution 



(4) 





F 2 \ 




exp y 


2A J 


- exp ^ 



(F-2F C 
2A 



dF . 



With this solution, the resulting MF is the original PS one, 
including the 'fudge' factor 2: 



n(A)dA = F c exp(-F c 2 /2A) / V2yrA 3 dA 



(•») 



If the filter is not SKS, the F trajectories are not random 
walks (F is no longer a Markov process); they are affected 
by strong correlations, and no Fokker-Planck equation can 
be written for their PDF. In this case, the differential MF re- 
duces to that of PS without the factor of 2 at large masses, 



and has a different slope at small masses; this was shown 
both by PH and by BCEK. Non-SKS filters are very difficult 
to deal with in this framework, as all the N-point correlation 
functions are relevant in determining the statistical proper- 
ties of the trajectories. However, PH found a reasonable and 
successful way to approximate the Pp° up (F, A) distribution; 
it will be described in Section 4. 

The powerful and elegant diffusion formalism has been 
used to find the merging histories of dark-matter clumps, 
as the solutions of a two-barrier diffusion problem (BCEK; 
Lacey & Cole 1993); Bower (1991) obtained the same results 
as extensions of the original PS work, without using the 
diffusion formalism. Lacey & Cole (1994) made a number of 
empirical choices to optimize the adherence of their formulae 
to N-body simulations: they took the SKS solution, which 
coincides with the original PS solution, and is easier to deal 
with, and let two parameters vary, namely the threshold 
parameter 5 C and the mass assigned to the filter; with these 
choices, they succeeded in fitting the abundance and the 
merging histories of simulated friends-of-friends dark halos. 

Another important problem of the PS statistical pro- 
cedure was outlined by Blanchard, Valls-Gabaud & Mamon 
(1992) and Yano, Nagashima & Gouda (1996) (even though 
the same problem had already been faced in Epstein 1983). 
If the spherical top-hat model is used to predict the collapse 
time, and if top-hat smoothing is consistently used, then the 
density contrast in a point has to be interpreted as a mean 
over a spherical volume of radius R. As a consequence, a 
collapse prediction has to be considered as relative not sim- 
ply to the point, but to the whole spherical region which 
surrounds it. The collapse condition has then to be changed 
to the following: a point collapses if it is embedded in a 
collapsing region, even though it is not at its center (and 
then its smoothed density contrast can be smaller than the 
threshold) . In paper I this kind of reasoning has been called 
global interpretation of the collapse time. This new collapse 
condition considerably complicates the statistical problem; 
Blanchard et al. (1992) have shown this to cause a flatten- 
ing of the MF with respect to the PS one, while Yano et al. 
(1996) have faced the problem by explicitly introducing the 
two-point correlation function in the collapse condition. 

The PS procedure was extended in a number of other 
papers. In particular, Lucchin & Matarrese (1988) extended 
the PS formalism to non-Gaussian density fields and Lilje 
(1992) to general cosmologies. Porciani et al. (1996) tried 
to introduce non-Gaussianity in the diffusion framework by 
reflecting all the trajectories crossing 8i = — 1, to avoid un- 
physical negative densities; in this way they found an in- 
triguing cutoff of the MF at small masses. The same dynam- 
ical content as in the PS approach is present in the block 
model of Cole & Kaiser (1989) and in the Monte Carlo ap- 
proach of Rodriguez & Thomas (1996). 

In the PS framework, the relevant objects to be con- 
sidered are the sets of points whose F upcrosses a given 
threshold; these sets are usually called excursion sets. Al- 
ternatively, one can suppose that the forming structures are 
not connected to general excursion sets, but to the peaks of 
the initial density contrast (see, e.g., Bardeen et al. 1986). 
Then, counting structures is reduced to counting the num- 
ber of peaks above a given threshold. With respect to the 
excursion set approach, the peak approach has a great ad- 
vantage, namely that the extended structures which are to 
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be counted are well defined, clearly connected to the peaks 
of the density field. However, the peak approach has a num- 
ber of disadvantages: (i) going from excursion sets to peaks 
greatly complicates the formalism, as the peak constraint 
is much stronger than a simple threshold constraint; as a 
consequence, an analytical determination of the number of 
peaks is hopeless if F is not a Gaussian or closely related pro- 
cess (except for high peaks; see Catelan, Lucchin & Matar- 
rese 1988); (ii) obtaining an MF from the number of peaks 
requires an estimate of the mass associated to a peak. Dif- 
ferent reasonable choices lead to different MFs (e.g., Ryden 
1988; Colafrancesco, Lucchin & Matarrese 1989; PH; Cava- 
liere, Colafrancesco & Scaramella 1991); (iii) it is very diffi- 
cult to solve the 'peak-in-peak' problem; this has been done 
by PH in a heuristic way, then by Appel & Jones (1990) 
and Manrique & Salvador-Sole (1995, 1996), and by Bond & 
Myers (1996) within their peak-patch theory; (iv) the peaks 
of the initial density field are generally not the first points 
to collapse; this has been shown by means of both theo- 
retical arguments (see, e.g., Shandarin & Zel'dovich 1989: 
in the Zel'dovich approximations, structures are connected 
with the peaks of Ai, the largest eigenvalue of the deforma- 
tion tensor) and numerical simulations (Katz, Quinn & Gelb 
1993; van de Weigaert & Babul 1994). 

A different approach, pioneered by Silk & White (1978) 
and Lucchin (1989), was used by Cavaliere and coworkers; 
see Cavaliere, Menci & Tozzi (1994) for a recent review and 
paper I for further comments. In their case, the existence of 
an MF is implicit, and kinetic evolution equations are given 
for its evolution; this is at variance with the PS and related 
approaches, where the MF is obtained from the evolution 
of the density contrast field. Besides, in their application of 
the Cayley tree formalism to the adhesion approximation 
(Cavaliere, Menci & Tozzi 1996; see also Vergassola et al. 
1995), the objects were identified with shocks in the collaps- 
ing medium. 

To determine the MF from the statistics of the F pro- 
cess, the excursion set approach has been chosen. It will be 
shown in the following that, in the SKS smoothing case, 
despite the complicate non-Gaussian nature of F, the MF 
problem can be recast in terms of a random walk with a 
moving absorbing barrier, while in the Gaussian filter case 
the useful approximation proposed by PH applies. A peak 
approach applied to the process F would better take into ac- 
count the complex geometry of the collapsed regions in the 
Lagrangian space, but, as mentioned before, it is intractable 
from the analytical point of view. 

Some remarks about the F process, already outlined in 
paper I, are worth stressing again: 

(i) As F is (the inverse of) a time, any threshold F c in 
this theory simply specifies the time at which the MF is 
examined; there is no free 5 C parameter. 

(ii) Smoothing is necessary because of the truncated na- 
ture of the dynamical approximations used. Thus the shape 
of the filter has to be chosen in order to optimize the per- 
formances of the dynamical predictions; usually Gaussian 
filters are preferred. 

(iii) The dynamical predictions are strictly punctual; in 
other words, a point collapses only if it is predicted to col- 
lapse (at a given scale), not if a neighboring collapsing point 
is able to involve it in its collapse. 



Point (i) is a result of the more detailed dynamics con- 
tained in this MF theory. Point (ii) implies that, even though 
we still have some freedom in choosing the shape of the 
filter, the Gaussian filter case has to be regarded as the 
most "physical" one. Point (iii) is connected to the discus- 
sion about the global interpretation of collapse times: in 
the Lagrangian perturbation case dynamical predictions are 
punctual, then the complications introduced by the global 
interpretation are avoided. Moreover, spherical symmetry of 
collapsed regions in Lagrangian space is not imposed, as it 
happened with the global interpretation of spherical collapse 
time. On the other hand, the exact size of the collapsed re- 
gions cannot be so easily determined as in the spherical case: 
as a consequence of the coherence of the underlying initial 
field, a point collapsed at a scale i?(A) will be part of a struc- 
ture of typical size R (or larger) , but the exact determination 
of its size requires knowledge of the space correlations of the 
process F. In other words, the difficulties skipped in the dif- 
fusion problem, thanks to the punctual interpretation of the 
collapse time, are found again in the A — > M transformation. 
This will be analyzed in Section 5. 

Before going on, it is worth commenting on the physical 
meaning of the absorbing barrier in the Lagrangian dynam- 
ical context. The nature of the dynamical prediction is such 
that most mass is predicted to collapse at small scales (92 
per cent according, for instance, to the Zel'dovich approxi- 
mation); the exact number is not easy to determine, as the 
behavior of the strongest underdensities is not well predicted 
by Lagrangian perturbation theory (see paper I). Anyway, 
it is unlikely that about 10 per cent of mass in the strongest 
underdensities is going to affect the MF in any observable 
mass range. Thus, the diffusion formalism is not needed here 
just to ensure a correct normalization, which is more or less 
guaranteed, but to solve the cloud-in-cloud inconsistency of 
the original PS procedure, which in this context has the fol- 
lowing meaning. As the power spectrum has no (or a very 
small) intrinsic truncation, a collapse prediction is assigned 
for every resolution to every point; these collapse predictions 
all have, in principle, the same validity. Recalling that col- 
lapse means to be enveloped in an OC region, it is natural 
to expect any point to be in OC at a small enough scale, 
and to be in single-stream regime at a large enough scale. 
The PS statistical procedure would suffice to find n(A) if 
the transition to OC occurred only once, i.e. if the process 
F never downcrossed the F c barrier. A downcrossing of F 
has the following meaning: a point is in OC at a large scale, 
but it is not at a smaller scale; this appears contradictory, 
as a collapsed structure does not contain non-collapsed sub- 
clumps. To overcome this inconsistency, it is assumed that 
OC at one scale implies OC at all smaller scales. This cor- 
responds to absorbing the trajectories of the F process that 
upcross the F c barrier. 



3 SHARP A-SPACE SMOOTHING 

The quantity F(A) is a non-linear and non-local functional 
of the initial (peculiar rescaled) potential ip(<\\ A). Then, it is 
not trivial to infer the properties of the F process as a func- 
tion of A. In the following, it will be shown that the multi- 
infinite-) dimensional process, defined by all the F and <p 
values in all the points, is a Markov process, so that the 
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statistics of first upcrossing of F can in principle be found by 
solving a multi- (infinite-) dimensional Fokker-Planck (here- 
after FP) equation. It will also be shown that the same first 
upcrossing statistics can be reproduced by studying a (one 
dimensional) Markov process with the same PDF as the F 
process. 

It is opportune, before going on, to introduce a num- 
ber of definitions which will be necessary in the following; 
a more detailed presentation can be found, for instance, in 
the textbooks by Arnold (1973) and Risken (1989). 

Let f(A) be a real stochastic process. Let's denote by 
x the value that the process £ takes at the resolution A. 
The process £ is said to possess the Markov property if its 
history at resolutions > A is determined by its value x at 
A, independently of its past history; in other words, a pro- 
cess possesses the Markov property if past and future are 
independent once the present is known. 

A Markov process is called a diffusion procesqrj if its 
trajectories are continuous and if the quantities Lr^> and 
D (2) exist: 

(f(A + dA) - £(A)) = D w dA + 0(dA 2 ) (6) 
((C(A + dA)-C(A)) 2 ) = D {2) dA + 0(dA 2 ) . 

The coefficients D (1) and L> (2) are called drift and diffusion 
coefficients. The PDF P x (x,A) of a diffusion process obeys 
a FP equation: 

d 



as functions of A, Wiener processes, with nonvanishing mu- 
tual correlations at fixed resolutions. Such processes con- 
stitute a 3D diffusion process; the evolution of their joint 
PDF is then controlled by a 3D FP equation with vanishing 
drift and constant diffusion (D[ 2w ^) coefficients. Collapse is 



assumed to have place if: 

F= X 1 (w 1 (A),w 2 (A),w 3 (A)) > F c 



(9) 



Then, the first upcrossing rate problem can be formulated in 
term of a 3D diffusion with a complicate absorbing barrier. 

On the other hand, the system {Ai,A2,u>3} can be ob- 
tained by means of a non-linear transformation of the {wt} 
system; it is easy to verify that such transformation is in- 
vertible and twice differentiable in any point, except in the 
line Wi — u>2 and — 0, a set of zero measure which is 
explicitly avoided by trajectories (as in the 3D case; see the 
joint PDF of eigenvalues, e.g., in M95). In this case, the {A;} 
(with A3 = W3) system is a Markov process, and, more pre- 
cisely, a diffusion process (Arnold 1973; Risken 1989). The 
drift and diffusion coefficients can be found by means of the 
following transformation rules: 



dP = 



d 2 \ 



-D 



D 



(2) 



dvjjdwk 3 
dA, d\ 



(2w) 



(10) 



dw k dwi 



D 



(2w) 



OA 



P*(x,A) 



|;Z><'W> + i|><*,A) 



A diffusion process with D' 1 ' = and D*- 2 ' = 1 is 
called a Wiener process, W(A). Its increments are uncorre- 
cted: (dW(A)dW(A')) = S D (A- A') (S D is the Dirac delta 
function), while (W(A)W(A')) = min(A, A'). The transition 
probability from A to A' is a Gaussian with variance (A— A'), 
and the PDF at resolution A is a Gaussian with variance A 
and zero mean. All these definitions can be easily extended 
to the case of multi-dimensional processes. 

In order to simplify the notation, in the following a 
stochastic process and its numerical values, although dif- 
ferent mathematical objects, will be denoted by the same 
symbol. 

3.1 A FP Equation for P F (F, A) 

It is useful to analyze first, as an example, the simple case of 
Zeldovich approximation in 2D. Initial conditions are given 
by the three independent component of the first-order de- 
formation tensor (a symmetric 2D matrix), wi, 102 and W3, 
the third one being the non-diagonal component. The two 
eigenvalues of the deformation tensor, Ai and A2, are simply: 

Ai, 2 = i (wi + W2 ± \Z(W! - w 2 ) 2 + iw 2 ^ , (8) 

where the + sign corresponds to the largest eigenvalue Ai = 
F. 

The three w% components, being linearly connected to 
the initial potential tp, are correlated Gaussian variables and, 



* Note that some authors reserve the name diffusion only to those 
processes which have constant drift and diffusion coefficients. 



Then, the problem can be reformulated by means of a more 
complicate FP equation with the simple barrier F = F c . 
Px(x, ) .(7) The same considerations can be generalized to any suffi- 
ciently well-behaved inverse collapse time F in 3D. Let y(q) 
be a Gaussian process in the Lagrangian space q (see paper 
I for the definition of Eulerian and Lagrangian coordinates) . 
tp is the initial peculiar rescaled gravitational potential, de- 
fined by the equation: 



V>(q) =*,(q) 



(11) 



where <5;(q) = <5o(q)/&o is the linear density contrast (<5o is 
the initial density contrast and bo — ao is the initial growing 
mode, nearly equal to the initial scale factor) . This Gaussian 
process is assumed to have power at all relevant scales, up to 
a very small cutoff. From the field tp a hierarchy of smoothed 
fields is obtained: 



¥>(q) ->■ <^(q; A) = tp(q) * W(q, A) 



(12) 



where * denotes a convolution, and W^q, A) is the SKS filter, 
which in the Fourier space is 6(1 — kR(A)). 

The inverse collapse time -F(q, A) of a point is a func- 
tional acting on the tp potential: 



F(q,A) =& c - 1 (q,A) = J- q [^(q',A)] 



(13) 



It is important to note that the functional T does not act 
directly on the A variable; in other words, it does not shuf- 
fle information coming from different resolutions. In many 
cases it is not possible to obtain the explicit form of this 
functional (see paper I for full details). Anyway, it will al- 
ways be assumed that it is twice differentiable with respect 
to the tp values, except in some possible degenerate cases, 
which constitute a set of zero measure in Lagrangian space. 

It is convenient, in order to handle the functional T in- 
troduced above, to consider discrete spaces; all the consid- 
erations which follow will be valid in the continuum limit. 
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Let's then consider the (Lagrangian) space divided into a 
large number N of points {qj}: 



F(q,A) - F(qi,A) =Fi(A) 
tp(q,A) -> (p(qi,A) = tpi(A) 
*,b(q',A)] - ^ qi (Mcu,A)})=^({^(A)}) 



(14) 



In this way the functional becomes an ordinary (non-linear) 
function of the TV resolution-dependent variables {tpi(A)}. 

The evolution equations of the {tpi} variables are those 
of a vector Wiener process. They can be written as follows: 



dfi = fij(A)dWj 



(15) 



(summation over repeated indexes is meant), where {dW j} 
are N independent Wiener processes. The /y (A) coefficients 
are such as to give the correct variances and spatial correla- 
tions of tp. Note the use of the differential notation in equa- 
tion @, which is common in stochastic mathematics, as 
time derivatives are ill-defined in this context. 

It is possible to find evolution equations for the func- 
tions Fi(A). With a chain-rule differentiation, assuming 
twice differentiability, the following system is obtained: 



dT 

dFi = —^{{tp})dtpj 



(16) 



difii 



fij(A)dWj 



This is a non-linear Langevin system. As mentioned above, 
this implies that the whole system {F, tp} is a Markov pro- 
cess; provided the regularity conditions given above (eq. ^ 
are satisfied, the system is a diffusion process (see Arnold 
1973 and Risken 1989 for a complete demonstration) . Then, 
the evolution of P{ p tlp \ , the joint PDF of the F and tp values 
in all the points, is given by a FP equation: 



{y,} 



dA 



_d_ 

dyi 



D?\{yi}) + 



2d yi d yj » 



(2) 



({!«}) 



(17) 



where denote the components of the {F, tp} vector. To find 
the first upcrossing rate, e.g., of the F\ process (evaluated 
at qi) above the barrier F c , such equation has to be solved 
with the constraint P{f,<p}(Fi = F c , . . .) = 0. 

Provided the {tp} — > {F} transformation is a well de- 
fined and invertible one, the {F} system is a Markov process 
by itself, and the problem can be formulated in terms of the 
{F} system only. 

A solution of such a multi-dimensional problem, while 
revealing the Markov nature of the problem, is not con- 
venient in practice. A great simplification would come 
from reducing such a multi-dimensional problem to a one- 
dimensional one. As the 1-point PDF of the F process alone 
is known, it is possible to construct a FP equation whose 
solution is Pf(F,A); if F were a Markov process, the con- 
strained solutions of the ID FP equation would give the right 
solution for its first-upcrossing rate. However, there seems 
to be no simple reason to conclude that F is a Markov pro- 
cess: the components of a multi-dimensional Markov process 
can possess not the Markov property (see, e.g., the example 
given in Risken 1989, Section 3.5); moreover, it is not pos- 
sible to obtain a FP equation for F — Ai by integrating out 
all the other variables in the FP equation (1171) , as the drift 



and diffusion coefficients are not constant and then cannot 
be taken out of the integrals. 

The solution of the ID problem can anyway be consid- 
ered as an ansatz to the true solution; it can be checked 
a posteriori, by means of direct numerical calculations, 
whether such ansatz leads to a correct description of the 
exact first upcrossing rate. This will be done in next Sub- 
section, where it will be shown that the ID solution leads 
to an almost perfect description of the first upcrossing rate 
problem. The numerical demonstration of this fact is suf- 
ficient for the scope of this paper, but leaves the question 
open of the reason of such behaviour, and of the possible 
Markov nature of the F process. Such complex problems 
will be addressed in future work. 

To construct a FP equation whose solution, with ini- 
tial condition Pf(F,0) — 8d(F) and natural boundary 
conditions (Pf(F,A) and its -F-derivative vanish at infin- 
ity), is Pf(F, A) (as given in paper I), it is convenient to 
transform the Pf(F,A) distribution to a Gaussian distri- 
bution with variance A, P x (x,A). The x(F) transforma- 
tion for A = 1 is given in paper I for the F processes 
corresponding to ellipsoidal collapse (hereafter ELL) and 
third-order Lagrangian collapse (hereafter 3RD). To obtain 
the Pf(F,A) distribution and the x(F) transformation at 
any A, the following self-similarity property can be used: 
P(F/y r A)dF/y r A = const; this is also valid for the P x (x,A) 
distribution. Then, 



P F (F,A)dF 
x(F,A) 



= v / APf(-F/v / A, VjdF 
= VIx(F/VA,1) . 



(18) 



With this transformations, the unconstrained PDF of 
the transformed quantity x is virtually identical to the dis- 
tribution of the linear density contrast Si (this is true at the 
1-point level; the N-point PDFs of the transformed quan- 
tities x at different points will generally not be Gaussian 
multivariates). The FP equation which admits P x (x,A) as 
a solution is obviously that of a Wiener process: 

1 d 2 

(19) 



d_ 

M' IV "'"' ~ 2dx 2 
Transforming back to the F variable, the FP equation for 
the Pf(F,A) distribution is obtained; its drift and diffusion 
coefficients are: 



D 



(i) 



D 



(2) 



dx 
dA "* 

1 / dx V 

2 \~dF~) 



1 d 2 x f dx V 

2 9F2 \~dF~) 



\dFj 



(20) 



If the transformation is linear, x — aF + b\/A, then the two 
coefficients become: 



D 



(i) 



D 



(2) 



(21) 



2aV~A 
1 

2a? ' 

However, it is convenient to work directly with the x process, 
whose FP equation is much simpler. 

To find the distribution of the trajectories which have 
not upcrossed a given threshold F c , the FP equation for F 
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Figure 1. The x c (A) barriers for the ELL and 3RD predictions 
(heavy lines), together with their 'linear' part (light lines). 



has to be solved with the boundary condition Pf{F c , A) = 
0. Transforming this condition back to the x variable, as 
the F — * x transformation is time-dependent, the absorbing 
barrier for the x process will move as A grows. Then, the 
MF problem is reduced to a Wiener diffusion problem with 
a moving absorbing barrier. 



3.2 The solution of the Moving Barrier Problem 

The diffusion problem with a fixed absorbing barrier, equiv- 
alent to the solution of the FP equation (191) with bound- 
ary condition P x (x c ,A) — 0, and with initial condition 
P x (x,0) = Sd(x), has a solution that has long been known 
(Chandrasekhar 1943), which can be obtained in the fol- 
lowing way: the initial condition is changed to P x (x, 0) = 
Sd(x) — Sd(2x c — x), i.e. a negative image is put in a po- 
sition symmetric with respect to the barrier. In the subse- 
quent evolution, the boundary condition is satisfied at any 
time by symmetry. It is easy to see that the initial condi- 
tion just shown leads to equation (^) as a solution (with x 
instead of F). This solution formally turns negative beyond 
x c ; the true solution is obviously null there. Note also that 
no meaningful solution exists if x c < 0: all the trajectories 
are absorbed from the start. 

In the moving barrier problem (x c = a; c (A)) the image 
method cannot be applied. In fact, a moving barrier problem 
is equivalent to a diffusion problem with non-null drift and 
a fixed barrier; this is the case for the F(A) diffusion. In 
this case, any negative image, put in a symmetric position 
with respect to the barrier, ought to move in a specular way 
with respect to the positive component, in order to ensure 
the PDF to be null at the barrier at any A. Thus, the image 
ought to move with a drift which is opposite in sign to the 
drift of the positive component; as a consequence, its PDF 
would not be a solution to the FP equation; it would be the 
solution to another FP equation, with drift of opposite sign. 

Appendix A contains a different form of the FP equa- 



tion, in which the barrier condition is implicitly contained 
in the drift and diffusion coefficients. Moreover, a solution is 
given in terms of a path integral. Having found no simple an- 
alytical solution, a numerical integration of the FP equation 
has been performed. Equation ( |19| ) has been integrated by 
means of the Cranck-Nicholson method, which consists in a 
finite-interval integration, stabilized by an artificial numeri- 
cal viscosity (see Press et al. 1992 for details). The goodness 
of the result depends on the parameter a = AA/2(A:r) 2 ; 
the result is stable for any a, but the small-scale features 
are better represented if a < 1. The following finite inter- 
vals have been chosen: Ax = 7.5 10 -3 , AA = 5. 10 -5 , which 
leads to a = 0.444; this is quite adequate, as the resulting 
distributions are very smooth. 

As shown in paper I, the x(F) transformation is ac- 
curately linear in F when F is larger than 1; this is true 
especially for ELL. In this case, the absorbing barrier is: 



x c (A) = 1.82F C - 0.69VA (ELL) 
x c (A) = 2.07F C - 1.02\/A (3RD) 



(22) 



Note that linear barriers are not linear in A! In the following, 
the F c = 1 barriers will be considered; any other F c value 
can be obtained by a A rescaling: A —> F^A. Fig. 1 shows the 
x c (A) barriers, based on both linear and non-linear F — > x 
transformations. The non-linear ELL barrier is accurately 
reproduced by the linear one up to moderately large A val- 
ues, while the non-linear 3RD barrier significantly departs 
from the linear one beyond A=l. 

The numerical solutions found with linear barriers have 
been compared with those found by simply inserting the 
x c (A) function into the fixed barrier solution, equation (^); 
these functions are denoted by P^{x, A). It turns out that 
the ratio between the two solutions is accurately fit by: 

P:° up (x,A)/P?(x,A) = J(x,A) = l+exp(/(A)x+ 5 (A)) .(23) 

Fig. 2a shows the numerical P^ onp (x, A) and the P* h (x,A) 
at A=l, for the linear ELL transformation; Fig. 2b shows the 
quantity ln(P" oup (x, A)/P^(x, A) - 1), which is accurately 
linear up to errors due to numerical precision. Figs 3a and 
b show the same for the 3RD prediction. 

To determine the / and g functions, note that the result- 
ing P"° up (a;, A) distribution is a solution of the FP equation 
(p.9[); thus the analytical fit has to reproduce its first deriva- 
tive in t and the second derivative in x. It is useful to obtain 
the n(A) in terms of the fixed-barrier solution P^°(x, A) and 
a correction term J(x,A); then the / and g functions can 
be tuned so as to give the correct J . The integral MF can 
be written as: 



/x c (A) 
PS oup (x,A)dx 
- oo 

r x c (A) 

P^{x,A)J{x,A)dx . 



(24) 



7i(A)is then: 

dfl(< A) 



n(A) 



dA 



c(A) 



dP£ oup 
OA 



dx - 
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Figure 2. ELL prediction, linear barrier, SKS smoothing, a) P" oup (x,A) and Pj^a;, A) at A=l. b) The logarithm of their ratio minus 
1. c) The J correction factor, numerical result and analytical fit. d) The resulting n(A) with the analytical fit. 



1 

2„. 
1 dP? 



(A) 



d 2 P™ 



-dx - 



2 da 



dx 2 

J(x c (A),A) = 



(25) 



= -££EL e (-^^ 2 / 2A ^(x c (A),A). 
V27tA7 

The commutation between the integral and the time deriva- 
tive in the first passage is justified by the fact that the in- 
tegrand always vanishes at the upper integration limit. The 
second passage is because of the FP equation (|l9|). In the 
third passage, the term with ^-derivatives of J vanishes be- 
cause Px(%, A) vanishes at x = x c . 



This last expression has been compared to the numeri- 
cal result in order to tune the / and g functions. The best-fit 
expressions are: 



/(A) 



-0.5 + 2.41A" 



(26) 



g(A) = 2.23 - 4.90A" 1 

for the linear ELL transformation, and 



/(A) = -0.3-r-2.05A~ 1 ' 13 (27) 
g(A) = 1.15-4.25A -1 

for the linear 3RD transformation. Figs. 2c and 3c show 
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Figure 3. The same as Fig. 2 with 3RD prediction, linear barrier, SKS smoothing. 



the numerical and proposed analytical corrections, equation 
(p3|), for the linear ELL and 3RD transformations. Figs. 2d 
and 3d show the corresponding n(A); the agreement is ex- 
cellent. Note that this correction is valid as long as x c > 0; 
when the barrier becomes negative (this fact has no rele- 
vant meaning), the fixed-barrier solution vanishes, so that 
this procedure cannot be used any more. 

It may be useful to express the new MF in terms of 
the classical PS one, with a free S c parameter. Writing the 
absorbing barrier as a; c (A) = 6 c (l + (x c (A)/8 c — 1)), the n(A) 
can be written as: 

© 1996 RAS, MNRAS 000, 



^expf-^-^+f^-^Vl (28) 
Oc V 2A J 

= / s (A)xI(A). 

The 1 correction factor has been defined in the same way as 
in M95. 

The linear transformation is only an approximation, 
valid up to moderately large F values. The complete trans- 
formation shows a falling tail at low F values, which corre- 
sponds to a pronounced peak around F=0.5 (Figs. 8a and 
b of paper I). The existence of this peak is confirmed both 
by ELL and by 3RD, but the exact position is not consid- 
ered a robust feature; at those F values the convergence of 
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-2 2 

In A 



Figure 4. The ELL and 3RD re(A) 

the Lagrangian series is not guaranteed. If this falling tail is 
modeled as in paper I, then the absorbing barrier becomes: 

x c (A) = 1.82F C - 0.69\/A- (29) 
0.4\/A(erf(-7.5iyVA + 1.75) + 1) (ELL) 

x c (A) = 2.07F C - 1.82VX- 

0.75VA(erf(-3F o /VA+ 1.18) + 1) (3RD) . 

Fig. 4a shows the linear and complete ELL n(A), Fig. 4b 
shows the same for 3RD. The small-massQ part depends on 
the details of the complete transformation, especially in the 
3RD case; in the ELL case the effect is modest even at rather 
large A values. As the details of the Pf{F,A) distribution 
at low F values are not considered reliable, the low-mass 
part of the MF is not considered a robust prediction of the 
theory. This is especially true at very large A values: in this 
case the fit of the x(F) transformation, given in paper I and 
used to get equations (p9|), is not accurate; on the other 
hand, that part of the Pf (F, A) distribution is very uncer- 
tain. Nonetheless, note that the non-linear element added 
in the x(F) transformation has the effect of enhancing the 
n(A) function at moderate A values, with a corresponding 
loss of small- mass objects. This fact, which in some sense in- 
troduces a second scale-length in the MF (the first, M*, cor- 
responding to the peak of n(A)), is somehow similar to the 
small-scale cutoff found by Porciani et al. (1996) (their effect 
on the MF is much more dramatic) . It is thus confirmed that 
the introduction of dynamical non-Gaussianity can lead to 
a large number of objects with intermediate masses, at the 
expense of small-mass objects. 



t I freely use the word mass in this context to indicate the large- 
mass (small A) or small-mass (large A) part of the MF; the exact 
correspondence between the two quantities is examined in Section 
5. 




In A 



curves: linear and complete barriers. 



ELL 




-2 
InA 



Figure 5. SKS ELL and 3RD n(A) curves, with complete barri- 
ers. A PS curve is shown for comparison. 

The (non-linear) complete ELL and 3RD n(A) func- 
tions are shown in Fig. 5, together with the original PS one 
(with the factor of 2). ELL and 3RD agree reasonably well at 
intermediate resolutions, while ELL gives more large-mass 
objects, as 3RD slightly underestimates quasi-spherical col- 
lapses (see paper I). On the other hand, their small-mass 
behavior is dominated by the non-linear features of the x(F) 
transformation. Compared to the PS prediction, both ELL 
and 3RD curves (i) show large-mass tails shifted toward 
large masses, (ii) give about a factor e ~ 2.7 more objects 
than PS around A=0, and (iii) show steeper small-mass tails 
(especially 3RD) . Point (i) is in agreement with the findings 
of M95; point (ii), somehow worrisome (the PS curve is in 
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Figure 6. SKS ELL and 3RD n(A) curves, with complete barriers: FP and numerical solutions. 



agreement with N-body simulations), will be solved by the 
use of Gaussian filters. 

Finally, the solutions found by solving the moving bar- 
rier problem have to be compared to the numerical solutions 
of the exact multi-dimensional problem, as explained in sec- 
tion 3.1, to decide on the validity of the FP solutions. The 
numerical solutions can be obtained by means of the Monte 
Carlo calculations already presented in paper I. Realizations 
of the initial potential are obtained in cubic grids of 16 3 or 
32 3 points; such realizations are smoothed by means of a 
hierarchy of SKS filters, having care to sample the fc-space 
so as to follow exactly the spacing of the modes of the cubic 
box. For any smoothing (116 for the 16 3 realizations, 464 
for 32 3 ones), the collapse time is calculated for any point 
which has not collapsed yet at smaller resolutions, and the 
upcrossing rate is directly calculated. The resolution range 
of such calculations is limited, so that, to cover a significant 
range, it is necessary to perform different sets of realizations 
with different normalizations. Three sets of 30 different 32 3 
realizations, with total variances 0.8, 2 and 5, and scale-free 
power spectra with slope —2, have been performed for the 
ELL case, while in the 3RD case, which is much more time 
consuming, four sets of 60 16 3 realizations, with variances 
0.8, 2, 5 and 12 have been used. 

Fig. 6 shows the comparison between the FP solutions 
and the direct numerical calculations. In the ELL case, the 
agreement between the FP and the numerical solutions is 
overall excellent. In the 3RD case, very delicate from the nu- 
merical point of view, the agreement is good at large masses, 
while the numerical curve is somehow below the FP one by 
~10% at intermediate masses. At small masses, In A > 1, 
a significant disagreement is visible; this is probably due to 
the fact that the analytical expression proposed in paper I 
for the Pf(F,A) distribution poorly fits the actual (numer- 
ical) distribution at small F values. This gives an idea of 
the dependence of the MF at small masses on the unreliable 
details of the dynamics of slowly collapsing points; a more 



detailed fit of the Pp (F, A) distribution is considered unwor- 
thy. It is however noteworthy how the small-mass decrease 
of the MF is emphasized by the numerical curve. 

In conclusion, it has been demonstrated that the mov- 
ing barrier problem gives an accurate solution of the multi- 
dimensional problem presented in Section 3.1. This fact per- 
mits on the one hand to solve the MF problem in a reason- 
able way, just by extending the diffusion formalism to the 
F process, and on the other hand it sheds some light on the 
nature of the F process as a function of the resolution; the 
question whether F possesses the Markov property will be 
faced by future work. 



4 GAUSSIAN SMOOTHING 

The elegant diffusion formalism presented above is strictly 
limited to SKS filtering. Any other filtering, which cuts the 
power spectrum in a non-sharp way, creates strong correla- 
tions in the F trajectories. This can be seen in Fig. 7, where 
a sample of nine (Markov) SKS and Gaussian trajectories 
are plotted (Gaussian trajectories are obtained by smooth- 
ing the same SKS trajectories; a scale-free power spectrum 
with n — —2 is used): while SKS trajectories are random 
walks, the Gaussian trajectories are strongly correlated, with 
a significant correlation length. This can be also seen as fol- 
lows: for a Gaussian process, the normalized correlation of 
the values of the process at different resolutions behaves as 
follows: 



(F(A)F(A + AA)) 
\J (F(A) 2 ){F(A + AA) 2 ) 



AM 



1 / AA\ 2 

2 v XT; 



(30) 



i.e. it is constant to first order in AA. A c is a s pectrum- 
dependent coherence length, equal to Ay / 2(3 + n) for scale- 
free spectra; in general: 



A c = 2A 7 (l- 7 2 )- 1/2 



(31) 
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Figure 7. Nine SKS and Gaussian trajectories; n = —2. 

where 7 is a standard spectral measure (see Bardeen et al. 
1986); 7 = ((n + 3)/(n + 5)) 1//2 for scale-free power spectra. 
In the SKS case, the A c scale vanishes, and the normalized 
correlation linearly decreases for small A variations. 

The main consequence of the strong correlation of tra- 
jectories in non-SKS smoothing is that, if a trajectory is 
just below the absorbing barrier, it cannot jump above it in 
a very small A interval, as in the SKS case. In the fixed bar- 
rier case, the PS formula without the factor of 2 is obtained 
at small A. Thus, the validity of the fudge factor 2 is limited 
to the very special case of SKS filtering; any other filtering 
gives a number of large-mass objects that is smaller by a 
factor of 2, and correspondly more small-mass objects, thus 
changing the shape of the MF. 

From a physical point of view, the stability of the F 
trajectories is a positive fact: the dynamical prediction of 
collapse is more stable when A varies. Gaussian filtering, in 
particular, has a number of merits: it is the most stable one 
(it minimizes the oscillations in the real and Fourier spaces; 
see also the comments in BCEK), and it optimizes the per- 
formances of dynamical predictions (Melott, Pellmann & 
Shandarin 1994; Buchert, Melott & Weifi 1994). Unfortu- 
nately, it is mathematically much easier to work with SKS 
filters, with which the F trajectories are the most noisy and 
least stable ones. 

The main problem with Gaussian-smoothed trajectories 
is that, as the filter does not cut the power spectrum in a 
sharp way, at a given A a trajectory contains information 
about the process at larger resolutions; then, the ip pro- 
cesses lose their Markov property. The Langevin equation 
for the </? processes, can be seen as an equation of motion 
with colored-noise forces. All the N-point correlations of tra- 
jectories at different resolutions are then decisive to obtain 
the ip PDF, which does not obey an FP equation. Obviously, 
all these features are valid also for the {F, <p} system, and 
for the F process itself, whose PDF will not obey any FP 
equation. Nonetheless, there is a motivated and successful 
way, proposed by PH for linear theory (F oc Si), to model 



Gaussian trajectories; this will be referred to as the PH ap- 
proximation. In this approximation, the trajectories are ap- 
proximated by a step process, constant in resolution over a 
scale of order A c ; different steps are assumed to be uncorre- 
cted. The correlation length was chosen by PH as 7rA c ln2 
(see their paper for details). The transition probability, from 
A' to A, of such a random step process can be written as: 



P(F,A;F',A') 



S D (F-F') 
P(F,A) 



if 
if 



A/ A' < A c 
A/A' > A c 



from which it is possible to obtain: 



P™ up (F, A)dF = J] / ' Pf ^ A ^ dF 



(32) 



(33) 



Taking a continuum limit over the logarithm of equation 
(p3[), the following expression is obtained (see PH and BCEK 
for details): 



fi(< A) = 1-P(F < Fc,A) 

xexpf [ lnP(F < F c , A') — 



(34) 



dA' 



A c (A')ln2 



The PH approximation has been shown, both in PH and in 
BCEK, to nicely represent the true PDF, numerically calcu- 
lated by simulating a large number of Langevin trajectories. 
A particular feature of Gaussian trajectories has to be noted: 
they explicitly depend, through their correlation length, on 
the power spectrum. 

The PH approximation, given the uncorrelated nature 
of the step process which it is based on, can be directly used 
to solve the moving barrier problem with Gaussian smooth- 
ing. Thus, expressing the integrals in equation ( |34| ) in the 
x(F) variable, 



n(A) 



exp(— xl/2A) I x 



2ttA V 2A 
ln(P(a; < x c ,A)) 



7rA c In 2 



dx c 
~dA 

fA 



+ P(x < X C ,A) 



(35) 



This expression can be compared to that obtained by means 
of a PS-like procedure, as the one followed in M95: 



fi(< A) = 1 - P(x < x c ,A) 



(36) 



this curve has been verified to coincide with the ELL M95 
one, if the ELL barrier is used (the small-mass part is bet- 
ter recovered here, as in M95 the asymptotic behavior of 
ellipsoidal collapse was forced to be that of the Zel'dovich 
approximation) . The PH and PS-like curves coincide at large 
masses, and are not very different overall (see Fig. 8). This 
is caused by the fact that the integral PS-like MF is a lower 
limit on the true integral MF (see BCEK), so, as only about 
10 per cent of the mass has to be redistributed, the difference 
between the two curves cannot be large, especially when the 
correlation length is large (large n). This is at variance with 
the SKS case, where many more objects are predicted to 
form at large masses, and this makes the n(A) curve be very 
different from the PS-like one. 

To test the validity of the PH approximation against 
direct numerical calculations, it is possible to follow two dif- 
ferent procedures. The first one, analogous to that used by 
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Figure 8. Comparison between the PH approximation and the numerical simulation of Langevin forces. Linear barriers have been used, 
a) ELL, n = -2. b) ELL, n = 1. c) 3RD, n = -2. d) 3RD, n = 1. 



Bond et al. (1991) (see also Risken 1989) consists in simulat- 
ing a large number of SKS x trajectories, smoothing them by 
means of a Gaussian filter and finally absorbing them with 
a moving x c (A) barrier. This procedure, valid in the case 
of Si(A) trajectories, is strictly correct (in the hypothesis 
that F is a diffusion process, as in Section 3) if the trajec- 
tories based on Gaussian-smoothed versions of the potential 
are the same as the Gaussian-smoothed SKS F trajectories; 
this is not verified in general, but only if collapse time and 
smoothing commute. In particular, as F(A) = T[ip* W(A)], 
smoothing (in Lagrangian space) and the functional T com- 
mute only if the functional is linear; this is the case for spher- 
ical collapse (linear theory with a threshold) . (Another nec- 



essary assumption is the invariance of the PDF with respect 
to filter shape; this is assured by the fact that smoothing is 
in Lagrangian space; see Bernardeau (1994)). A second and 
more correct procedure, analogous to the one proposed in 
the PH paper, is to repeat the same calculations performed 
in Section 3, based on the Monte Carlo realizations of paper 
I, with Gaussian filtering. The first procedure has the ad- 
vantage of allowing to set both the spacing and the range in 
resolution as wanted, while the second method can be used 
to assess the validity of the results obtained by means of the 
first method. 

The first procedure has been implemented as follows: 
2000 random increments have been simulated for each tra- 
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Figure 9. a): ELL SKS, Gaussian (n = —2 and 1) and PS-like curves, b): As a) with 3RD. c): Gaussian ELL and 3RD curves for n = —2; 
a PS curve with 8 C = 1.5 is shown for comparison, d): as c) with n = 1. 



jectory, in a resolution range from exp(— 4) to exp(4). The 
smoothed trajectories have been computed for 100-150 reso- 
lutions; their stability makes it unnecessary to use finer sam- 
plings. Scale-free power spectra, with n — —2 and —1, have 
been used; for larger n the PH and PS-like curves are so sim- 
ilar that it is difficult (and not useful) to decide which curve 
is better fit by the simulations. Fig. 8 shows the results for 
50000 trajectories, compared to the PH approximations and 
the PS-like curves, for linear ELL and 3RD absorbing barri- 
ers. The results have been rescaled to Ag, the variance of the 
Gaussian process, which, for scale-free spectra, is related to 
the SKS onebyA G = (n + 3)/2 P((n + 3)/2) Ask s (T is the 
usual Gamma function). The PH approximation accurately 



reproduces the n(A) curves, maybe slightly overestimating 
them around A=l; the asymptotic behaviors are correctly 
reproduced. The numerical curves are accurate enough to 
prefer the PH curves with respect to the PS-like ones, espe- 
cially at large A and for n = — 2. 

Figs. 9a and b compare the ELL and 3RD n(A) curves 
(complete barrier) obtained with SKS smoothing, Gaussian 
smoothing (PH approximation, n = —2 and —1) and the 
PS-like ones. The following things can be noted: 

(i) the Gaussian curves are below the SKS ones by a fac- 
tor of 2 in the large- and intermediate- mass part; this surely 
mitigates the problem noted above regarding the SKS curve 
at intermediate resolutions. 
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Figure 10. ELL prediction: comparison of the PH approximation 
with the direct simulation of the ELL collapse with Gaussian 
smoothing. 



(ii) The small-mass slope of Gaussian curves is less steep 
than the SKS ones. 

(iii) The dependence of the n(A) Gaussian curves on the 
spectrum is modest, and only slightly affects the small-mass 
part. 

Figs. 9c (n = —2) and d (n = 1) show the ELL and 
3RD Gaussian n(A) curves, compared to a PS curve with 
a 5 C = 1.5 value. All the conclusions given for SKS curves, 
on the asymptotic behaviors, remain valid, but this time 
the central part of the MF is just slightly above the PS 
one. Moreover, the Gaussian curves are similar to the PS 
curve with a lower 5 C value; this confirms the findings of 
M95. However, any comparison with a PS curve, considered 
as a fit to N-body simulations, is just qualitative, as the 
objects which are described here can be different from the 
friends-of-friends halos which are usually extracted from the 
simulations. 

The second numerical method has been implemented 
as follows: the 32 3 Monte Carlo realizations of ip have been 
smoothed by a hierarchy of Gaussian filters, then absorbing 
the obtained trajectories with a barrier at F c = l. At variance 
with the SKS calculations presented in Section 3, the sam- 
pling in A does not have to be very fine, as trajectories are 
much stable; the same resulting n(A) curves are much more 
stable than the SKS ones. Two sets of simulations with dif- 
ferent normalizations have been used to cover a significant 
range in A. Fig. 10 presents the resulting n(A) for the two 
sets of 30 realization; the ELL prediction has been used. 
The PH curve is found to reproduce the numerical curve in 
a satisfactory way. Its validity is then confirmed. 



5 FROM RESOLUTION TO MASS 

The quantities considered up to now, namely fi(< A) and 
n(A), depend on the resolution A. To determine the MF, a 



relation between A and the mass M is required. The col- 
lapsed medium gathers in clumps, which are identified as 
structures, provided they are reasonably separate in real 
space; it is necessary to determine how the mass is dis- 
tributed among those clumps. The simplest hypothesis is 
that a single mass forms at every A: M — M (A); it is then 
reasonable to assume this mass to be proportional to the 
cube of the smoothing scale: M oc P-R 3 , as Rf is the rel- 
evant characteristic scale for the forming clump. The pro- 
portionality constant can be obtained by connecting M to 
the mass contained in the smoothing filter, as in PS and in 
many relevant papers, or can be left as a free parameter, as 
in Lacey & Cole (1994). In the peak approach the situation 
is inverted with respect to the PS ansatz: the number of 
objects is clearly connected to the number of peaks above a 
given threshold, but the mass associated to a peak, and then 
the normalization of the MF itself, is not clearly determined. 

With the above assumption, the MF can be easily cal- 
culated as follows: 



n(M)dM = j^n(A(M)) 



dA(M) 



dM 



dM 



(37) 



As the A — > M transformation is a simple functional rela- 
tion, all the conclusions given above about the n(A) quantity 
are valid for n(M). For scale-free power spectra, A(M) is: 

-(3+n)/3 



A = (M/M, 



(38) 



where M* is the mass corresponding to unit variance; note 
that M, is different, by a factor of (2/<5 c ) 3/(3+n) , from the 
usual PS M* parameter used in the literature; since this the- 
ory has no S c parameter, that factor has been omitted. Figs. 
11a (n = —2) and b (n = 1) show the MFs as predicted by 
ELL and 3RD (with Gaussian smoothing), and a PS curve 
with S c = 1.5 for comparison; because of self-similarity, the 
MFs are given as a function of M/M*. Spectra have been 
normalized by assuming unit variance over a top-hat radius 
of 8/i _1 Mpc. Note how the differences between the n(A) 
curves are much less visible in the MFs, especially for small 
spectral indexes; this is mainly due to the huge dynamical 
range spanned by the MFs. 

The reasonable assumption of a simple functional rela- 
tion between mass and resolution is just an approximation 
of what really happens. Suppose that a whole distribution 
of masses is assigned to a given resolution: 



A^p(M;A) 



(39) 



p giving the probability of having a mass M from a res- 
olution A. Then the differential MF can be calculated as 
follows: 



n(M)dM 



M 



n(A)p(M, A)dA dM 



(40) 



i.e., the n(A) curve is convolved with the distribution of the 
masses forming at a given resolution. 

The physical origin of this p distribution can be ex- 
plained as follows. The real collapsed regions are related to 
the excursion sets of F above the F c threshold, together with 
the points which are predicted to collapse at smaller A. At 
the large mass end, since F is roughly proportional to the 
density contrast (spherical collapse is asymptotically recov- 
ered at small F values) , the excursion sets are characterized 
by isolated, simply connected regions in Lagrangian space, 
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Figure 11. MFs for scale-free power spectra (a):n = —2, b) n = 1), as a function of M/M*: ELL and 3RD predictions with Gaussian 
smoothing, and PS with <5 C = 1.5. M12 = 10 12 M©. 



each containing a single peak (see, e.g., Adler 1981; Bardeen 
et al. 1986). It is then natural to assume that each region 
ends up in an isolated clump. These clumps will not all have 
the same mass; rather, the masses will trace a given p dis- 
tribution, peaked on a given mass scale proportional to the 
filter mass, and with a given width. Both mean value and 
dispersion will in general depend on the power spectrum and 
on the shape of the filter (in this case Gaussian niters are 
to be preferred, as excursion sets are much more stable). At 
smaller resolutions the situation is considerably more com- 
plex, as the topology of the excursion sets is multiply con- 
nected. It is thus necessary to give a prescription to fragment 
the medium into isolated clumps. As a consequence, the p 
distribution will depend on the way the collapsed medium 
fragments into clumps, and its shape will probably not be 
as simple as before. In conclusion, the introduction of this 
p distribution is not expected to dramatically influence the 
MF at the large mass end; on the other hand, it is likely to 
influence significantly the shape of the small-mass part. 

While the punctual nature of the dynamical prediction 
makes it possible to avoid taking spatial correlations explic- 
itly into account in determining n(A), the statistical spatial 
properties of the F process are relevant for determining how 
the mass gathers into collapsed structures. A precise analyti- 
cal determination of the distribution p(M; A) is prohibitive: 
even for Gaussian processes, only mean values of the ex- 
tension of the excursion sets can be obtained (Adler 1981). 
Thus, this quantity can only be quantified through Monte 
Carlo simulations and careful comparisons with extended 
N-body simulations. This will be done in paper III of this 
series. 



6 SUMMARY AND CONCLUSIONS 

In this paper, the statistical tools, needed to determine an 
MF from the PDF of the inverse collapse times given in 



paper I, have been constructed. Given any inverse collapse 
time prediction F — l/b c (the "time" variable b being the 
linear growing mode) , calculated by means of any dynamical 
approximation of truncated type, it has been demonstrated 
numerically that, if the smoothing of the initial field is SKS, 
the fraction of collapsed mass at A, Q(< A), which is given 
by the solution of a multi- (infinite-) dimensional FP equa- 
tion, can be calculated by extending the diffusion formal- 
ism of Bond et al. (1991) to the F process; the absorbing 
barrier is set at F c , the inverse of the time at which the 
MF is required. This procedure corresponds to the following 
interpretation: when a point is collapsed according to the 
prediction relative to a resolution A, it is considered to be 
collapsed at any larger resolution, even though the predic- 
tion F(A) does not explicitly give collapse. Note that, due to 
the punctual nature of the collapse prediction, the 1-point 
PDF of F suffices in determining the fraction of collapsed 
mass. 

The F process, assumed to be a Markov process, can 
be transformed to a Gaussian Wiener process x - a random 
walk - such that the P x (x, A) distribution is a Gaussian with 
variance A; this transformation has already been found in 
paper I. In this way, the fixed absorbing barrier problem 
for F is transformed into a moving absorbing barrier prob- 
lem for x. This problem has been numerically solved, and 
a good analytical approximation has been found in the rel- 
evant cases for which the F — > x transformation is linear; 
this is the case for the large- F parts of the Pf {F, A) distri- 
butions for 3rd-order Lagrangian perturbation theory and 
ellipsoidal collapse. Finally, it has been numerically verified 
that the solution of the moving barrier problem gives a cor- 
rect description of the real upcrossing rate of the F process. 

When the smoothing is not SKS, the F(\) process is 
not a diffusion process, as different scales are mixed by the 
non-sharp truncation of the power spectrum. In this case, 
the F process is much more stable, so that a good approxi- 
mation of it can be obtained by considering it constant over 
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its correlation length. Thus, the useful approximation de- 
veloped by Peacock & Heavens (1990) can be used. The PH 
approximation adequately reproduces the numerical simu- 
lations (performed by means of two different methods) of 
Langevin trajectories absorbed by a moving barrier. The 
advantages of dealing with Gaussian smoothing are its phys- 
ical meaning (Gaussian smoothing is usually recommended 
when using truncated dynamical approximations; see paper 
I) and its stability with respect to A. The obvious disad- 
vantage is that it is much more complicated to solve the 
absorbing-barrier problem. 

To determine the number of collapsed objects from the 
fraction of collapsed mass at a given A, information is needed 
about how the collapsed points gather together in extended 
structures. This is where the spatial correlations of the F 
process are decisive. A first approximation can be given by 
assuming that a mass equal to that contained in the smooth- 
ing filter is formed. More realistically, a whole distribution 
p(M; A) of masses has to be associated to every resolution; 
this distribution, which will depend on the power spectrum, 
is likely to affect the small-mass part of the MF consider- 
ably. A purely analytical determination of the p distribution 
is problematical even in the case of Gaussian processes; its 
study is postponed to the forthcoming paper III. 

The following final conclusions can be drawn on the MF: 

(i) A larger number of large-mass objects is expected to 
form with respect to the simple PS prediction; this agrees 
with the conclusions of M95. 

(ii) The large-mass part of the MF is robust with respect 
to the dynamical prediction: different reasonable dynamical 
predictions give similar results. The ELL prediction tends 
to give more objects than the 3RD one in the large-mass 
tail; this is due to the fact that 3rd-order Lagrangian the- 
ory slightly underestimates spherical collapse; thus the ELL 
prediction is considered more believable in that range. 

(iii) Explicit, reasonably accurate, analytical solutions 
are given for the large-mass parts of the SKS and Gaussian 
MFs. 

(iv) When Gaussian smoothing is used, the resulting MFs 
give fewer objects by roughly a factor of 2 in the large-mass 
part than the SKS one. This makes the Gaussian curves very 
similar to the PS one with a S c — 1.5 parameter. 

(v) The Gaussian MFs are preferred to the SKS ones be- 
cause Gaussian smoothing optimizes the dynamical predic- 
tions, stabilizes the trajectories, and lowers the peak of the 
MF. 

Note that the dynamical predictions analyzed here, par- 
ticularly the 3rd-order Lagrangian one, tend to produce 
more intermediate-mass objects, at the expense of the small- 
mass ones, somehow introducing a second characteristic 
scale in the MF. 

The small-mass part of the MF is not considered a ro- 
bust prediction of the theory, for at least three reasons: 

(i) The definition of collapse, given in paper I, which is 
based on the concept of orbit crossing, is not expected to 
reproduce common small-mass structures like virialized ha- 
los. OC regions rather represent those large-scale collapsed 
environments in which the virialized halos are embedded. 

(ii) All the dynamical predictions given in paper I are 
considered good as long as the inverse collapse time is not 



small. Thus, the small-mass part of the MF is based on non- 
robust dynamical predictions. 

(iii) The p distribution of the forming masses, at a given 
resolution, is likely to dramatically affect the small-mass 
part of the MF. 
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The first term of the right hand side,linear in P, can be 
interpreted as a branching term which kills the trajectories 
upcrossing the barrier (see Cavaliere et al. 1996). The solu- 
tion of this equation can be written in terms of a functional 
integral; it is easy to show that (see Risken 1989, Section 
4.4.2): 



P™ up (a; ,A) = lim / dx? 



^c(A i+ i) 

dxi 



iV — 1 

11 ,/9^ 



exp 



(x l+1 - Xj) 2 

2dA 



dxo 



(A6) 



Unfortunately, the variable upper limits make this integral 
very hard to solve. 

This paper has been produced using the Royal Astronomical 
Society/Blackwell Science IATjrjX style file. 



APPENDIX A: ALTERNATIVE 
FORMULATION OF THE FP EQUATION 



Equation ([f9|), with a moving absorbing barrier, can be ex- 
pressed in a form which includes the barrier condition in 
the drift and diffusion coefficients; this can help in finding 
analytical solutions. Let £(t) be a Wiener process, and let x 
be the value it takes at time t (the process was called x(i) 
in the text; t corresponds to A). The P x (x, t) PDF obeys an 
evolution equation of the kind: 



dP 
~dt 



E 

71 = 1 



( dx) 



D (n) (x,t)P . 



(Al) 



Drift and diffusion coefficients are the first two of the 
series; all the other coefficients vanish for continuous Markov 
processes. The coefficients can be found by means of the 
Kramers- Moyal expansion (Risken 1989): 

1 1 f 

— — lim — — / (x — x') n P(x, t + At\x' , t)dx . 
n\ At-,o At J y ' K ' 1 ' ' 

These coefficients can be easily calculated by integrating 
the Gaussian transition probability: 



P(x, t + At\x' , t) = exp( — (x — x 



(A3) 



in the relevant integration range, i.e. from -co to x c (t). The 
result of this integration is: 



D w (x,t) 
D (2) (x,t) 

i(n) 



-5 D (x c (t) - x) 
0{x c (t) - x)/2 



(A4) 



D w {x,t) = 



n > 2 



The interpretation of this result is simple: the upcrossing of 
the barrier is contrasted by an infinite discontinuous drift, 
while diffusion is switched off beyond the barrier. With some 
algebraic manipulation, the equation can be written as: 



dt 2 x- xc(t) 2 v w ' dx 2 



(A5) 
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